Knowledge and Information Systems REGULAR PAPER
نویسندگان
چکیده
XML has already become the de facto standard for specifying and exchanging data on the Web. However, XML is by nature verbose and thus XML documents are usually large in size, a factor that hinders its practical usage, since it substantially increases the costs of storing, processing, and exchanging data. In order to tackle this problem, many XML-specific compression systems, such as XMill, XGrind, XMLPPM, and Millau, have recently been proposed. However, these systems usually suffer from the following two inadequacies: They either sacrifice performance in terms of compression ratio and execution time in order to support a limited range of queries, or perform full decompression prior to processing queries over compressed documents. In this paper, we address the above problems by exploiting the information provided by a Document Type Definition (DTD) associated with an XML document. We show that a DTD is able to facilitate better compression as well as generate more usable compressed data to support querying. We present the architecture of the XCQ, which is a compression and querying tool for handling XML data. XCQ is based on a novel technique we have developed called DTD Tree and SAX Event Stream Parsing (DSP). The documents compressed by XCQ are stored in Partitioned Path-Based Grouping (PPG) data streams, which are equipped with a Block Statistics Signature (BSS) indexing scheme. The indexed PPG data streams support the processing of XML queries that involve selection and aggregation, without the need for full decompression. In order to study the W. Ng (B) · W.-Y. Lam Department of Computer Science, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong E-mail: [email protected] P. T. Wood · M. Levene School of Computer Science and Information Systems, Birkbeck, University of London, Malet Street, London, UK
منابع مشابه
Knowledge Flows Automation and Designing a Knowledge Management Framework for Educational Organizations
One of an important factor in the success of organizations is the efficiency of knowledge flow. The knowledge flow is a comprehensive concept and in recent studies of organizational analysis broadly considered in the areas of strategic management, organizational analysis and economics. In this paper, we consider knowledge flows from an Information Technology (IT) viewpoint. We usually have tw...
متن کاملBehavioral Considerations in Developing Web Information Systems: User-centered Design Agenda
The current paper explores designing a web information retrieval system regarding the searching behavior of users in real and everyday life. Designing an information system that is closely linked to human behavior is equally important for providers and the end users. From an Information Science point of view, four approaches in designing information retrieval systems were identified as system-...
متن کاملInvestigating Challenges of agricultural knowledge and information systems (AKIS) in Iran with Delphi technique
Sustainable agricultural and rural development requires knowledge and information, skills, attitudes and technologies, which run through a network of actors to produce, distribute and use it in a particular place. The model of the Agriculture Knowledge and Information Systems (AKIS) is designed based on this recognition. In Iran, There are about decades of experience in the development of this ...
متن کاملتأثیر سیستمهای اطلاعاتی منابع انسانی بر اشتراک دانش با میانجیگری فرهنگسازمانی (موردمطالعه: شعب بانک ملت شیراز)
The main purpose of this study is to investigate the effect of human resources information systems on knowledge sharing through the mediation of organizational culture in Branches of Mellat Bank in Shiraz city. In this study, the key indicators for each variable of the study (human resources information systems, knowledge sharing and knowledge-based organizational culture) was given. then, they...
متن کاملThe Effect of Knowledge Management through Human Resources Information Systems on Customer Relationship Management in Aquatic Sport Centers
Objectives. The purpose of this research is to investigate the impact of knowledge management through information systems of human resources on customer relationship management. Methods. The study population included managers and employees of Mashhad's aquatic sport centres like ‘Blue Waves’, ‘Armaghan’, ‘Surging Waves’, and ‘Sunshine Coast Park’...
متن کاملA New Approach for Knowledge Based Systems Reduction using Rough Sets Theory (RESEARCH NOTE)
Problem of knowledge analysis for decision support system is the most difficult task of information systems. This paper presents a new approach based on notions of mathematical theory of Rough Sets to solve this problem. Using these concepts a systematic approach has been developed to reduce the size of decision database and extract reduced rules set from vague and uncertain data. The method ha...
متن کامل